Multiple Classifier Systems Incorporating Uncertainty
نویسنده
چکیده
The inclusion of uncertain class information into multi classifier systems (MCS) is the central theme in this thesis. A multi classifier system relies on multiple base classifiers, each of which is trained on a separate view of the problem at hand. Combining their answers will lead to a more accurate final decision. An example would be emotion recognition, with decisions based on observations of the mouth, the eyes or the pitch of the voice. Traditionally in classification one sample is associated with exactly one class, for example anger. But, in practical applications, such a hard distinction is not warranted; instead a sample should have soft class memberships, thus being associated fuzzily with multiple classes at the same time. The inclusion of this uncertain information into various, but isolated building blocks of a MCS has been tackled by a great many researchers. This thesis places these approaches in the greater MCS context and assesses their utility. Remaining problems are identified and in many cases a solution is proposed. Bayesian probability is the most obvious tool for modelling class uncertainty, but perhaps the Dempster-Shafer theory of evidence, fuzzy logic or fuzzy sets, or even a distribution of opinions are much more viable in a classification context. These formal uncertainty theories, as well as some others, are assessed regarding their aptitude to support the core flavours of uncertainty in MCS, as identified in this work: vagueness, imprecision, and certainty. For the very fitting Dempster-Shafer theory, practical applications are reported. Some base classifiers have been extended to be trained on and answer with uncertain labels: learning vector quantisation, self-organizing maps, and most notably support vector machines (SVMs). The latter are an already very powerful breed of classifiers, and based on the idea of duplication, the underlying optimisation problem could be altered to accept fuzzy labels. Obtaining soft outputs from the binary SVMs is not trivial, but complete solutions are provided for the Onevs-Rest and One-vs-One multiclass decomposition architectures. Experiments do confirm the effectiveness of the fuzzy trained machines over their hard trained
منابع مشابه
Using Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کاملUsing Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کاملImproving Accuracy in Intrusion Detection Systems Using Classifier Ensemble and Clustering
Recently by developing the technology, the number of network-based servicesis increasing, and sensitive information of users is shared through the Internet.Accordingly, large-scale malicious attacks on computer networks could causesevere disruption to network services so cybersecurity turns to a major concern fornetworks. An intrusion detection system (IDS) could be cons...
متن کاملIncorporating prior knowledge induced from stochastic differential equations in the classification of stochastic observations
In classification, prior knowledge is incorporated in a Bayesian framework by assuming that the feature-label distribution belongs to an uncertainty class of feature-label distributions governed by a prior distribution. A posterior distribution is then derived from the prior and the sample data. An optimal Bayesian classifier (OBC) minimizes the expected misclassification error relative to the ...
متن کاملPush Back Design in Two-element Deposits Incorporating Grade Uncertainty (RESEARCH NOTE)
Push back design as a complex task, is one of the major steps in the open pit mines planning. Push backs can be generated by varying economic factors such as commodity price, mining cost, processing cost, etc. Another important issue in generating push backs is grade uncertainty, which can cause the problem be more complex. Conventional methods of push back design ignore grade uncertainty. To o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010